Cross-Lingual Cross-Document Coreference with Entity Linking
نویسندگان
چکیده
This paper describes our approach to the 2011 Text Analysis Conference (TAC) Knowledge Base Population (KBP) cross-lingual entity linking problem. We recast the problem of entity linking as one of cross-document entity coreference. We compare an approach where deductive entity linking informs crossdocument coreference to an inductive approach where coreference and linking judgements are mutually beneficial. We also describe our approach to cross-lingual entity linking comparing a native linking approach with an approach utilizing machine translation. Our results show that inductive linking to a native language knowledge base offers the best performance.
منابع مشابه
Cross Lingual Entity Linking with Bilingual Topic Model
Cross lingual entity linking means linking an entity mention in a background source document in one language with the corresponding real world entity in a knowledge base written in the other language. The key problem is to measure the similarity score between the context of the entity mention and the document of the candidate entity. This paper presents a general framework for doing cross lingu...
متن کاملLearning a Cross-Lingual Semantic Representation of Relations Expressed in Text
Learning cross-lingual semantic representations of relations from textual data is useful for tasks like cross-lingual information retrieval and question answering. So far, research has been mainly focused on cross-lingual entity linking, which is confined to linking between phrases in a text document and their corresponding entities in a knowledge base but cannot link to relations. In this pape...
متن کاملNeural Cross-Lingual Entity Linking
A major challenge in Entity Linking (EL) is making effective use of contextual information to disambiguate mentions to Wikipedia that might refer to different entities in different contexts. The problem exacerbates with cross-lingual EL which involves linking mentions written in non-English documents to entries in the English Wikipedia: to compare textual clues across languages we need to compu...
متن کاملMEANTIME, the NewsReader Multilingual Event and Time Corpus
In this paper, we present the NewsReader MEANTIME corpus, a semantically annotated corpus of Wikinews articles. The corpus consists of 480 news articles, i.e. 120 English news articles and their translations in Spanish, Italian, and Dutch. MEANTIME contains annotations at different levels. The document-level annotation includes markables (e.g. entity mentions, event mentions, time expressions, ...
متن کاملInteroperability of cross-lingual and cross-document event detection
We describe a system for event extraction across documents and languages. We developed a framework for the interoperable semantic interpretation of mentions of events, participants, locations and time, as well as the relations between them. Furthermore, we use a common RDF model to represent instances of events and normalised entities and dates. We convert multiple mentions of the same event in...
متن کامل